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Building an Interim Assessment System: A Workbook for School Districts 



How to Use This Workbook 

As someone with a stake in a school district’s systems, you probably do not have all the answers around what is necessary to build an 
effective interim assessment system. Neither does this workbook. But it is intended to have the right questions. More precisely, this 
workbook contains the vision, infrastructure, and resource questions critical to a thorough, district-level self-examination of readiness for 
an interim assessment system. Staff at state departments of education can also use this resource when district-level staff approach the state 
agency seeking guidance. These questions are intended to guide you and other district stakeholders through a structured consideration of 
building or revisiting your district’s interim assessment system. The questions are intended to be answered collaboratively, not by a district 
administrator in isolation. The authors of this workbook believe that a team of district stakeholders addressing these questions is the 
approach most likely to yield progress in creating a lasting, successful system. 

The workbook is organized into the following nine sections: 

• Goals and Vision 

• Elements and Features 

• Professional Development 

• Alignment to Standards and Curriculum 

• Costs and Capacity 

• Test Development and Technical Quality 

• Administration and Scoring 

• Reporting 

• Evaluation 

A Definition of Interim Assessments 

In a paper commissioned by the Council of Chief State School Officers’ (CCSSO) collaborative on Technical Issues in Large-Scale 
Assessment (TILSA), Crane (2008) offers the following definition of interim assessments: 

Assessments administered multiple times during a school year, usually outside of instruction, to 
evaluate students’ knowledge and skills relative to a specific set of academic goals in order to inform 
policymaker or educator decisions at the student, classroom, school, or district level. The specific 
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interim assessment designs are driven by the purposes and intended uses, but the results of any 
interim assessment must be reported in a manner allowing aggregation across students, occasions, or 
concepts, (p. 2) 

This definition builds on earlier work by Perie, Marion, and Gong (2007). This revised definition emphasizes that interim assessment is 
periodic in nature, and with regard to both stakes and frequency, falls between classroom-level, low-stakes, high-frequency formative 
assessment and state-level, high-stakes, low-frequency summative assessment. Links to these resources, as well as updated work by Perie 
and her colleagues (2009), are available in Appendix A. 





Formative 


Interim 


Summative 


Typical Use 


feedback to adjust ongoing 
teaching and learning 


monitoring student progress 


student placement; school 
and district accountability 


Frequency of 
Administration 


continual; multiple times a 
day 


generally 2—6 times per 
school year 


usually once a school year 


Scope of 
Administration 


student and classroom 


usually school or district 
(could be student, as in a 
response to instruction and 
intervention model) 


usually state 



Source: Crane, 2008, p. 4. 



School districts are at different stages in their thinking and planning concerning interim assessment. Some are considering implementing 
their first-ever system. Others are revisiting an established system. The questions in this workbook address both the system and the items 
and tests that make up the system. Most school districts can benefit from looking at all the sections of this workbook; however, two 
sections are more technical than the rest. Districts just beginning the process may want to defer a serious consideration of the “Alignment 
to Standards and Curriculum” and “Test Development and Technical Quality” sections until they have built a more substantial foundation 
to support an interim assessment system. The first section, “Goals and Vision,” is in our view the most important. Discussions on an 
interim assessment system need to solidify and establish consensus on goals and vision before proceeding to other topics. 

The Council of Chief State School Officers’ (CCSSO) collaborative on Technical Issues in Large-Scale Assessment (TILSA) authored this 
workbook, with Eric Crane of WestEd as lead author. This TILSA group geared the workbook to a universal level. Some of your questions 
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may not be addressed within these pages. However, we hope and expect that a self-examination guided by this workbook will prove useful 
to a diverse array of school districts. 
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GOALS AND VISION 



Question 


Relevance / Follow-up Questions 


1 . What is the purpose of the 
interim assessment system? What 
are our goals for this program? 


This is the central question of this process. 
The importance of this question cannot be 
overstated. 

Crane (2008) identified eight uses of interim 
assessments: diagnosis, prediction, preparation, 
placement, student evaluation, school intervention, 
promotion/graduation, and local accountability 
(see Appendix B). Which of these apply to the 
district, and how will they be addressed? 

□ Diagnosis 

□ Prediction 

□ Preparation 

□ Placement 

□ Student evaluation/monitoring student 

progress 

□ School intervention 

□ Promotion/graduation 

□ Local accountability 

Are the uses supported by appropriate 
infrastructure or technical work (e.g., if predictive 
validity is a use we will rely on, then technical 
studies must be carried out to support such use)? 
Note that with each additional use to which the 
district will put the data, additional technical work 
or infrastructure will need to be in place. 



Notes / Response 
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GOALS AND VISION 

Question Relevance / Follow-up Questions Notes / Response 

2. What action steps will be taken as Are there curricular materials at hand that can 

a result of this assessment? facilitate re-teaching? Will the results be used to 

evaluate teachers? Can such use be justified by the 
assessment? 

3. How will student learning and A clear articulation of the valued outcomes — and 

other valued outcomes be affected what will change to promote these outcomes — is 
as a result of using this interim essential to principals’ and teachers’ (and other 
assessment system? stakeholders’) understanding, support, and 

implementation of the system. Identifying specific 
examples of how student learning will be affected 
is essential. 

4. Who will use the information Realistically, there will be discomfort and resistance 
gathered from this assessment? to a new system. The system will need champions 
What buy-in or sense of ownership who can explain the reasons for moving to the new 
exists from various stakeholders and system and the system’s benefits. This can promote 
what will be needed to make this buy-in. 

program successful and of value? 

5. Can we explain how the interim The system should provide important information 

assessment system fits into the that is more standardized than the formative 
larger landscape of other assessment information that teachers constantly 

assessments our students take? generate. On the other hand, the interim 

assessment information should point to “next 
steps” for learning and instruction, which is not a 
primary purpose of the end-of-year summative 
assessment. 

6. What is the scope of the A matrix showing content coverage by grade level 

program? Grades? Content areas? could help stakeholders see the potential breadth of 
Would phasing in the program, the program. 

initially including only certain grades 
or content areas, make sense? 
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GOALS AND VISION 

Question Relevance / Follow-up Questions Notes / Response 

7. Have principles of universal Look for assessment materials that developers have 

design been incorporated to ensure designed for use by all populations. If developing 
that the needs of special your own assessment materials, make sure that 

populations are addressed? expertise on universal design is available. 

8. Do we want to consider Can we achieve better alignment than if we 

developing our system internally, or purchased a product? Do we have the capacity, 
do we want to purchase a vendor resources, and funding to develop a product? Do 
product? we have the funding to purchase a product? 

9. Are content standards in place Are there district-level content standards and 
and is this program or assessment in objectives that need to be considered along with 
support of those standards? (See state standards? 

also “Alignment to Standards and 
Curriculum” section.) 

10. Have we remembered to reflect This applies when districts are purchasing a test or 

the relevant questions from this releasing an RFP for test or item development 
document in our request for services such as online testing, custom test 

proposal (RFP)? Is the vendor development, scoring and reporting, and item 
willing to provide references from development. RFPs include performance 

other school districts? requirements, legal statements, and many other 

elements that are beyond the scope of this 
workbook. 

Before you leave this section, verify that there is consensus around the goals and vision for the interim assessment system. 
Decisions about goals and vision should serve as guiding touchstones for all later work. As you answer questions in the 
following sections of the workbook, refer back to this “Goals and Vision” section to inform your thinking and decision making. 
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ELEMENTS AND FEATURES 

Question Relevance / Follow-up Questions Notes / Response 

1. Can the assessment show student Student growth can be defined in various ways, 

growth? (See also “Reporting” Measuring growth within a year is generally more 
section.) straightforward than measuring growth across 

years. It may cost more to track growth across 
years. Growth within a year may be less expensive 
to track, but it may not yield the information that 
the district needs. 

Does the vendor claim that there is a vertical scale? 

Is there statistical information to justify such a 
claim? 

2. Is the assessment valid for What technical work supports this? 

different student populations? (See 

also “Test Development and 
Technical Quality” section.) 

3. Does the assessment provide How much and what kinds of evidence? 
evidence of strengths and 

weaknesses at the individual student 
and group levels? 

4. Does the assessment come with Does the system offer any sort of next steps based 

instructional materials and on results? If instructional materials represent a 

strategies? If not, does it link to separate component, what is their cost? What is 

instructional resources? Are the their quality? 

instructional materials and strategies 
linked to state standards or grade- 
level expectations? 

5. How do these interim Teachers feel more positively about assessments 

assessments fit within our local and when it is clear how they fit together in a balanced 
state assessment system and how do assessment system. 
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ELEMENTS AND FEATURES 


Question 


Relevance / Follow-up Questions 


Notes / Response 


they link/ connect to/ support the 
other components in our system? 






6. For vendor purchases, is online 
or telephone technical assistance 
available and provided in a timely 
manner? 


Especially when implementing a new system, 
support can be critical. Is the vendor willing to 
provide references of actual customers (other 
districts) who could address this question? 
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PROFESSIONAL DEVELOPMENT 

Question Relevance / Follow-up Questions Notes / Response 

1. What professional development What stakeholders need PD? Will the PD offered 

(PD) will various stakeholders to different stakeholders be tailored to that group? 

want/ need? How will that PD help If so, how? 

us ensure that the data from the 
tests are used correcdy and 
effectively? 

2. How compatible is the Assessment that is coordinated with ongoing PD is 

assessment with pre-existing or more likely to be successful. 

ongoing professional development 
provided or supported by the 
district? 

3. Does the assessment system Training on how to use the results is critical, 
include professional development Are teachers or trainers from the district expected 
(PD)? If so, is the PD about how to to provide the PD? 

use the product only, or does the 
PD include how to use the results? 

4. Have we planned for adequate Does the PD promote or reinforce buy-in? 

professional development early, There will be discomfort and resistance to a new 
including prior to adoption of the system. Scheduling professional development early 
system, if applicable? can increase comfort with the system. In the best 

case, staff who had been oppositional can become 
advocates (Crane, 2008). 
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ALIGNMENT TO STANDARDS AND CURRICULUM 

Question Relevance / Follow-up Questions Notes / Response 

1. How aligned is the assessment to If the alignment to state content standards is not as 

state academic content standards? good as desired, how should results and inferences 
Do we want the assessment to be be discussed differently? Can the vendor indicate to 
aligned to any other standards in what standard the items are aligned? What happens 
addition? and what are the consequences if the assessment is 

less than fully aligned? 

2. Has the vendor conducted an Review the vendor’s materials to verify claims. Ask 

alignment process? Do we want to for additional details if materials are vague, 
sponsor or conduct our own How was the alignment conducted? Will the 

alignment study? vendor be responsive to findings from an item 

review by the district’s teachers? 

3. If there is an alignment study, are Besides topics, is the cognitive complexity or depth 

the following elements published: of knowledge published? 

the number and percentage of items 
aligned to the standards, the 
number and percentage of 
standards for which there are no 
items, and the percentage of items 
that do not align to the standards? 

4. Do the assessments reflect a Have the state and local curriculum standards been 

logical scope and sequence in examined to ensure that what is being assessed is 

curriculum? being taught? Is there a mechanism in place for 

teacher committees to study the assessment results 
and make curriculum recommendations or 
modifications? 

5. How do we want the assessments Some purchased interim assessments may not 
to “link” or connect to our existing match the timing of items in the curriculum, 
textbooks and curriculum 

materials/ resources? 
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COSTS AND CAPACITY 

Question Relevance / Follow-up Questions Notes / Response 

1. What is the cost of the system? Is Are additional resources needed to get full benefit 

the cost in line with the value of the from the system? Cost includes opportunity cost of 
system and with its goals? other things the district could have done, 

instructional time, resources, staff time, and 
computer time. 

2. Can we afford it? It is sometimes better to envision the ideal before 

focusing on reality. It may help with the visioning 
process. If keeping costs down is paramount, are 
there possibilities for collaboration with other 
districts? 

3. Do we have the staff time Are the likely benefits so high that we will find the 

available for training in a new time, even if it appears that we do not have it 

system? available? 

4. Do we have the technical The vendor may not be the only option for a 

expertise to manage certain parts of consultant to help manage the system. 

the system ourselves? 

5. Is this an online system hosted by Does the district need servers? Scanners? How well 

the vendor? equipped are school sites? It is important to include 

information technology staff in the planning and 
discussion. 
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This section, “Test Development and Technical Quality,” contains more technical information than other sections of the 
workbook. Not all school districts will have in-house expertise regarding the technical issues of assessment development. We 
encourage school districts to work with qualified psychometricians or other advisors if the districts need additional technical 
expertise. 

TEST DEVELOPMENT AND TECHNICAL QUALITY 

Question Relevance / Follow-up Questions Notes / Response 

1. What type(s) of items do we want Does the assessment include tasks or 

in the tests? performances? This is especially important if the 

state assessments emphasize these formats and the 
interim assessments are meant to monitor readiness 
for those exams. 

2. What access will various If items are secure, this will impact remediation, 

stakeholders have to the items? Will Consider publishing release and exposure policy on 
items be released? the district’s web site. 

3. How much involvement do we Do we want/ need tests that are pre-made, or do 

want to have in test development? we want a bank of items so that we can build our 

own assessments? Does the district have the 
necessary technology and professional expertise to 
support teacher-developed assessments? Does the 
system allow for item development by teachers? 

4. How much staff time do we have Would other administration or scoring models 
to conduct the tasks required by the require less (or more) time? 
administration? (See also 

“Administration and Scoring” 
section.) 

5. Is the interim assessment What kinds of data would back up claims of 

predictive of other important predictive validity for interim assessments? Be a 

measures or outcomes? critical consumer of vendor examples, asking 

questions about the evidence. 

6. How relevant is the information Sometimes high-quality testimonials and quasi- 
that was in the documentation of experimental studies are more relevant than a gold- 
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TEST DEVELOPMENT AND TECHNICAL QUALITY 

Question Relevance / Follow-up Questions Notes / Response 

technical quality? What level of standard research design (such as that required by 
technical rigor do we require? the federal What Works Clearinghouse). 

7. How rigorous is the technical There is considerable range of district practice 
foundation of the assessment? here. A gold-standard level of demonstrated 

technical quality will be appropriate for some 
districts. Other districts may be comfortable 
without the same level of demonstrated technical 
quality. 

Examples include: 

• detailed study design 

• links to instruction 

• thorough statistical and psychometric 
information 

• evidence that each item was edited for spelling, 

grammar, usage conventions, and cuing and item 
writing principles (adapted from Siskind and 
Potter, 2006; a more complete listing of Siskind 
and Potter’s elements of a rigorous technical 
foundation is in Appendix C) 
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ADMINISTRATION AND SCORING 

Question Relevance / Follow-up Questions Notes / Response 

1 . How often will we administer the Will retesting be allowed with the same items? If 

tests? How are the limitations of yes, then item exposure may be a problem: 

our calendar and our equipment exposure may be a problem if accountability is a 

matched to these issues? purpose, but retesting may be advisable 

instructionally, if preparation is a purpose. 

2. What administration/ delivery Paper-pencil, online, or both? Computer adaptive? 

mode(s) do we want or need? 

3. Does the assessment have the Can the test be administered to meet the needs of 

flexibility to be administered at the students with disabilities and English learners? 

group or individual level? 

4. What elements does the system Can the test be administered at grade level only, or 

include that ensure uniform above or below also? 

administration for all students, 
except those receiving appropriate 
accommodation? 

5. What logistics related to the Will we do local scanning? Will we scan at each 

administration of the test can we school or at a central location? What materials are 
support? needed to administer and score the tests? 

6. What technology resources are Do we have the requisite number of computers? 
available and what will be needed? How long do data need to be stored? Will we track 

results over multiple years? 

7. How quickly can the tests be If open-ended items are included, what training will 

scored? be provided to scorers? 

8. How will we score the tests? Can instructional time be used for administration 

and scoring the tests? 
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In this section, “Reporting,” a critical framing question for the conversations can be, “How does reporting support the 
purpose(s) of the interim assessment system?” 



REPORTING 


Question 


Relevance / Follow-up Questions 


Notes / Response 


1. Does the interim assessment 
allow for comparisons to other 
populations? 


Does it allow for district-wide or statewide 
comparisons? 




2. Can the system help determine 
trends regarding systematic gaps or 
what is missing from the prior 
grade? Gaps associated with 
particular teachers? 


Do assessment results seamlessly flow into a 
student information system? Do some stakeholders 
have a concern that results will be used for teacher 
evaluation, and if so, how do we address that 
concern? 




3. How quickly are reports 
generated and returned to schools 
and teachers? 


Are there also parent reports? Reports that 
students can use? 




4. How easy are the reports to 
interpret and use? Is there a good 
representation of the change in a 
student’s skills? 


Are there examples available from other districts in 
our state? What inferences can be drawn from 
scores on the assessment? 




5. Are data reported in terms and 
formats that can be quickly and 
easily understood, disaggregated, 
and utilized? 


What is the mode of reporting? Are there links 
with electronic student-level data systems? Does 
the assessment provide subscores? Is it set at the 
same level of difficulty as the state test? 




6. Who has access to the data? How 
secure are the data? 


What are the privacy masking requirements of the 
assessment data? Are user-level data access rights 
clearly defined and documented? 




7. Can the system effectively track 
students who change schools within 
the district? 


How often are student and teacher rosters 
updated? Can students who change districts be 
tracked if the same interim assessments were used 
in their previous district? 




8. Will the assessment integrate into 


Do the assessment scores flow seamlessly into the 
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REPORTING 


Question 


Relevance / Follow-up Questions 


Notes / Response 


a school’s response to intervention 
(Rtl) framework? 


databases and systems that support Rtl? 
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EVALUATION 


Question 


Relevance / Follow-up Questions 


Notes / Response 


1 . How do we know whether the 
program has worked? 


It helps to establish measurable goals for 
subsequent evaluation. 




2. What is our plan to review the 
effectiveness of this program? 
When will that review take place? 
What should be evaluated during 
that review? 


Clear goals and timelines are a must. 

What is our exit strategy if the evaluation does not 
show that the program is effective? 




3. Since many contracts with 
vendors are on a year-to-year basis 
(due to fiscal concerns), how can we 
tie renewal of the product to 
evaluation results? 


Explore possibilities with vendors. A competitive 
or challenging economic time may present 
unprecedented opportunities for negotiation. 
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Appendix A: Resources 

Crane, E. W. (2008). Interim Assessment Practices and Avenues for State Involvement. Washington, DC: Council of Chief State School 
Officers. Available at http:/ / www.ccsso.org/Publications/Download.cfmPFilenameMnterimAssessmentPractices-web.pdf. 

Perie, M., Marion, S., & Gong, B. (2007). The role of interim assessments in a comprehensive assessment system: A policy brief. Available 
at http:/ /www.nciea.org/publications/PolicyBriefFINAL.pdf. 

Perie, M., Marion, S., & Gong, B. (2009). Moving toward a comprehensive assessment system: A framework for considering interim 
assessments. Educational Measurement: Issues and Practice 28(3), 5—13. 



Other References 

Crane, E. W., Rabinowitz, S., & Zimmerman, J. (2004). Locally tailored accountability: Building on your state system in the era of NCLB. 
[Knowledge Brief] San Francisco: WestEd. Retrieved March 11, 2008, from http://www.wested.org/online_pubs/KN-04-01.pdf. 

Siskind, T. & Potter, D. (2006). To Members of the State Board of Education and Members of the South Carolina Education Oversight 
Committee, memorandum, 9 November 2006, re Adoption List for Formative Assessments. 



Appendix B: Eight uses of interim assessment, excerpted from Crane (2008) 

In all, eight uses of interim assessment were identified: diagnosis, prediction, preparation, placement, student evaluation, school intervention, 
promotion/ graduation, and local accountability. The purposes of interim assessments can stretch from the borders of formative assessment to 
those of summative assessment. For example, interim assessments are used formatively when they are used for diagnosis. At the opposite 
boundary of interim assessments are those with more summative uses, such as promotion/graduation and local accountability. The 
remaining uses are suggestive of more of a “middle ground” of purposes for these tests. The assignment of these uses along a continuum is 
intended as a rough guide; how tests are being used in real instances — the purposes to which the tests are actually put — should drive the 
thinking and any characterization about them. 

At the formative margin-. 
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• Where the goal is identification of weak areas of performance for subsequent remediation, the goal of the interim assessment is 
diagnosis. 

In the interim assessment “middle ground”: 

• Some jurisdictions (and many test publishers) claim that interim assessment provides a good forecast of student performance on 
the high-stakes test. In this case, the test is used for prediction. 

• Where interim assessments are cited as helping students become familiar and comfortable with a test that may mirror the 
mandatory high-stakes test later in the school year, the jurisdiction’s goals include preparation. 

• Some jurisdictions use scores on interim tests for placement , to help inform what courses the student should be taking next. 

• Perhaps the oldest use of interim assessment is for student evaluation , as traditional texts and curricula often have included chapter 
and unit tests that provide an evaluation of student understanding of recently completed material. 

• Some jurisdictions require schools in need of improvement to use interim assessment. In this case, interim assessment serves as 
one initiative in a school intervention strategy. 

At the summative margin: 

• Where the progress shown across interim tests affects students’ passage to the next grade, the tests are used for promotion. 
Likewise, state graduation requirements, such as the body-of-evidence-based approaches of Rhode Island and Wyoming, may 
include demonstrating learning over time through progress on district interim assessments. 

• Interim assessment’s purposes may be directed at the school as well as at students, as school districts may include interim 
assessment data as an input in a system of local accountability (Crane, Rabinowitz, & Zimmerman, 2004). 

Appendix C: Expanded listing of technical features in interim assessments, adapted from the 2007—2008 Formative Assessment 

Adoption List for South Carolina (Siskind and Potter, 2006) 

• experimental or quasi-experimental study design 

• adequate description of links to instruction 

• adequate description of sampling plan 

• indication of study duration 

• adequate description of data analysis, including statistical techniques used 

• adequate description of study’s findings and their practical significance 
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• adequate sample size or repetitions in the study 

• adequate statistical and psychometric information — description of field test sample; reliability indices; standard error for each 
score point; decision consistency indices; item difficulty for each item; item discrimination for each item; DIF statistics for each 
item; evidence of bias and sensitivity review for all items; evidence that each item was edited for spelling, grammar, usage 
conventions, and cuing and item writing principles; information about score derivation; information about the interpretation of 
test scores 



This workbook was prepared by Eric W. Crane, senior research associate at WestEd, in collaboration with the Technical Issues in Earge-Scale Assessment 
(TIES A) collaborative of the Council of Chief State School Officers (CCS SO). 
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